Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Exploration textométrique du corpus des dossiers de Bouvard et Pécuchet

Identifieur interne : 000009 ( France/Analysis ); précédent : 000008; suivant : 000010

Exploration textométrique du corpus des dossiers de Bouvard et Pécuchet

Auteurs : Alexei Lavrentiev [France] ; Serge Heiden [France]

Source :

RBID : Hal:halshs-00678874

Descripteurs français

Abstract

This paper presents an experience of creating and analyzing a corpus of Bouvard et Pécuchet files in the methodological and technological framework of textometry, which is completely different from that of the project where these files were produced. It shows the advantages of using an open and interdisciplinary encoding system that is provided by the XML standard and the guidelines of the Text Encoding Initiative Consortium (TEI), but also points out the limits due to the extreme variability of TEI encoding practices and to the difficulty of reconciling a very precise documentary representation of the primary sources encoding practice with identifying the semantic structures relevant for textometric analysis.

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

Hal:halshs-00678874

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="fr">Exploration textométrique du corpus des dossiers de Bouvard et Pécuchet</title>
<author>
<name sortKey="Lavrentiev, Alexei" sort="Lavrentiev, Alexei" uniqKey="Lavrentiev A" first="Alexei" last="Lavrentiev">Alexei Lavrentiev</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-51028" status="VALID">
<idno type="IdRef">080671594</idno>
<idno type="RNSR">200311862K</idno>
<orgName>Interactions, Corpus, Apprentissages, Représentations</orgName>
<orgName type="acronym">ICAR</orgName>
<desc>
<address>
<addrLine>5, av Pierre Mendès-France 69676 BRON CEDEX</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://icar.univ-lyon2.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-6818" type="direct"></relation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-300042" type="direct"></relation>
<relation name="UMR5191" active="#struct-441569" type="direct"></relation>
<relation active="#struct-303652" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-6818" type="direct">
<org type="institution" xml:id="struct-6818" status="VALID">
<idno type="IdRef">149154992</idno>
<orgName>École normale supérieure - Lyon</orgName>
<orgName type="acronym">ENS Lyon</orgName>
<desc>
<address>
<addrLine>15 parvis René Descartes - BP 7000 - 69342 Lyon Cedex 07</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ens-lyon.eu/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300042" type="direct">
<org type="institution" xml:id="struct-300042" status="VALID">
<orgName>INRP</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5191" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-303652" type="direct">
<org type="institution" xml:id="struct-303652" status="OLD">
<orgName>Ecole Normale Supérieure Lettres et Sciences Humaines</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Heiden, Serge" sort="Heiden, Serge" uniqKey="Heiden S" first="Serge" last="Heiden">Serge Heiden</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-51028" status="VALID">
<idno type="IdRef">080671594</idno>
<idno type="RNSR">200311862K</idno>
<orgName>Interactions, Corpus, Apprentissages, Représentations</orgName>
<orgName type="acronym">ICAR</orgName>
<desc>
<address>
<addrLine>5, av Pierre Mendès-France 69676 BRON CEDEX</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://icar.univ-lyon2.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-6818" type="direct"></relation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-300042" type="direct"></relation>
<relation name="UMR5191" active="#struct-441569" type="direct"></relation>
<relation active="#struct-303652" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-6818" type="direct">
<org type="institution" xml:id="struct-6818" status="VALID">
<idno type="IdRef">149154992</idno>
<orgName>École normale supérieure - Lyon</orgName>
<orgName type="acronym">ENS Lyon</orgName>
<desc>
<address>
<addrLine>15 parvis René Descartes - BP 7000 - 69342 Lyon Cedex 07</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ens-lyon.eu/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300042" type="direct">
<org type="institution" xml:id="struct-300042" status="VALID">
<orgName>INRP</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5191" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-303652" type="direct">
<org type="institution" xml:id="struct-303652" status="OLD">
<orgName>Ecole Normale Supérieure Lettres et Sciences Humaines</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:halshs-00678874</idno>
<idno type="halId">halshs-00678874</idno>
<idno type="halUri">https://halshs.archives-ouvertes.fr/halshs-00678874</idno>
<idno type="url">https://halshs.archives-ouvertes.fr/halshs-00678874</idno>
<date when="2014-03-07">2014-03-07</date>
<idno type="wicri:Area/Hal/Corpus">000030</idno>
<idno type="wicri:Area/Hal/Curation">000030</idno>
<idno type="wicri:Area/Hal/Checkpoint">000010</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">000010</idno>
<idno type="wicri:Area/Main/Merge">000010</idno>
<idno type="wicri:Area/Main/Curation">000010</idno>
<idno type="wicri:Area/Main/Exploration">000010</idno>
<idno type="wicri:Area/France/Extraction">000009</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="fr">Exploration textométrique du corpus des dossiers de Bouvard et Pécuchet</title>
<author>
<name sortKey="Lavrentiev, Alexei" sort="Lavrentiev, Alexei" uniqKey="Lavrentiev A" first="Alexei" last="Lavrentiev">Alexei Lavrentiev</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-51028" status="VALID">
<idno type="IdRef">080671594</idno>
<idno type="RNSR">200311862K</idno>
<orgName>Interactions, Corpus, Apprentissages, Représentations</orgName>
<orgName type="acronym">ICAR</orgName>
<desc>
<address>
<addrLine>5, av Pierre Mendès-France 69676 BRON CEDEX</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://icar.univ-lyon2.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-6818" type="direct"></relation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-300042" type="direct"></relation>
<relation name="UMR5191" active="#struct-441569" type="direct"></relation>
<relation active="#struct-303652" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-6818" type="direct">
<org type="institution" xml:id="struct-6818" status="VALID">
<idno type="IdRef">149154992</idno>
<orgName>École normale supérieure - Lyon</orgName>
<orgName type="acronym">ENS Lyon</orgName>
<desc>
<address>
<addrLine>15 parvis René Descartes - BP 7000 - 69342 Lyon Cedex 07</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ens-lyon.eu/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300042" type="direct">
<org type="institution" xml:id="struct-300042" status="VALID">
<orgName>INRP</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5191" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-303652" type="direct">
<org type="institution" xml:id="struct-303652" status="OLD">
<orgName>Ecole Normale Supérieure Lettres et Sciences Humaines</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Heiden, Serge" sort="Heiden, Serge" uniqKey="Heiden S" first="Serge" last="Heiden">Serge Heiden</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-51028" status="VALID">
<idno type="IdRef">080671594</idno>
<idno type="RNSR">200311862K</idno>
<orgName>Interactions, Corpus, Apprentissages, Représentations</orgName>
<orgName type="acronym">ICAR</orgName>
<desc>
<address>
<addrLine>5, av Pierre Mendès-France 69676 BRON CEDEX</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://icar.univ-lyon2.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-6818" type="direct"></relation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-300042" type="direct"></relation>
<relation name="UMR5191" active="#struct-441569" type="direct"></relation>
<relation active="#struct-303652" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-6818" type="direct">
<org type="institution" xml:id="struct-6818" status="VALID">
<idno type="IdRef">149154992</idno>
<orgName>École normale supérieure - Lyon</orgName>
<orgName type="acronym">ENS Lyon</orgName>
<desc>
<address>
<addrLine>15 parvis René Descartes - BP 7000 - 69342 Lyon Cedex 07</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ens-lyon.eu/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300042" type="direct">
<org type="institution" xml:id="struct-300042" status="VALID">
<orgName>INRP</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5191" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-303652" type="direct">
<org type="institution" xml:id="struct-303652" status="OLD">
<orgName>Ecole Normale Supérieure Lettres et Sciences Humaines</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="fr">
<term>Bouvard et Pécuchet</term>
<term>Flaubert</term>
<term>humanités numériques</term>
<term>textométrie</term>
<term>édition électronique</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>édition électronique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper presents an experience of creating and analyzing a corpus of Bouvard et Pécuchet files in the methodological and technological framework of textometry, which is completely different from that of the project where these files were produced. It shows the advantages of using an open and interdisciplinary encoding system that is provided by the XML standard and the guidelines of the Text Encoding Initiative Consortium (TEI), but also points out the limits due to the extreme variability of TEI encoding practices and to the difficulty of reconciling a very precise documentary representation of the primary sources encoding practice with identifying the semantic structures relevant for textometric analysis.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
</list>
<tree>
<country name="France">
<noRegion>
<name sortKey="Lavrentiev, Alexei" sort="Lavrentiev, Alexei" uniqKey="Lavrentiev A" first="Alexei" last="Lavrentiev">Alexei Lavrentiev</name>
</noRegion>
<name sortKey="Heiden, Serge" sort="Heiden, Serge" uniqKey="Heiden S" first="Serge" last="Heiden">Serge Heiden</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/France/Analysis
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000009 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/France/Analysis/biblio.hfd -nk 000009 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    France
   |étape=   Analysis
   |type=    RBID
   |clé=     Hal:halshs-00678874
   |texte=   Exploration textométrique du corpus des dossiers de Bouvard et Pécuchet
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024